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Method and apparatus for segmenting data to create mixed raster content plane 



(54) 

(57) An improved technique for compressing a color 
or gray scale pixel map representing a document using 
an MRC lormat includes a method of segmenting an 
original pixel map into two planes (12.16). andthen com- 
pressing the data or each plane in an efficient manner. 
The mage is segmented by separating the image into 



two portions at the edges. One plane contains image 
data lor the dark sides of the edges, while image data 
for the bright sides of the edges and the smooth portions 
ot the image are placed on the other plane. This results 
in improved image compression ratios and enhanced 
image quality. 
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Description 

[0001] This invention relates generally to image 
processing and. more particularly, to techniques lor seg- 
menting, classifying and/or compressing the digital rep- 
resentation ol a document. 

[0002] Documents scanned at high resolutions re- 
quire very large amounts of storage space. Instead of 
being stored as is, the data is typically subjected to some 
form of data compression in order to reduce its volume, 
and thereby avoid the high costs associated with storing 
it. •Lossless" compression methods such as Lempel-Ziv 
Welch (LZW) do not perform particularly well on 
scanned pixel maps. While 'tossy* methods such as 
JPEG work fairly well on continuous-tone pixel maps, 
they do not work particularly well on the parts of the page 
that contain text. To optimize image data compression, 
techniques, which can recognize the type of data being 
compressed, are needed. 

[0003] Known compression techniques are described 
in US-A-5778092, US-A-5251271 . US-A-5060980, US- 
A-5784175, US-A-530331 3 and US-A-5432870. 
[0004] In one embodiment, the present invention dis- 
closes e method of segmenting a pixel map represen- 
tation of a document which includes the steps of: acquir- 
ing a block of the digital image data, wherein the digital 
image data is composed of light intensity signals in dis- 
crete locations; designating a classification for the block 
and providing an indication about a context of the block; 
segmenting the light intensity signals in the block into 
an upper subset and a lower subset based upon the des- 
ignated classification; generating a selector set which 
tracks the light intensity segmentation: and separately 
compressing the digital image data contained in the up- 
per and lower subsets. 

[0005] In another embodiment the present invention 
discloses a method of classifying a block of digital image 
data into one of a plurality of image data types, wherein 
the block of data is composed of light intensity signals 
in discrete locations : which includes: dividing the block 
into a bright region and a dark region; dividing a low pass 
filtered version of the block into a bright region and a 
dark region; calculating average lighl intensity values for 
each of the bright region, the dark region, the filtered 
bright region and the filtered dark region; and comparing 
a difference between the bright region and the dark re- 
gion average light intensity values to a filtered difference 
between the bright region and the dark region average 
filtered light intensity values; if the average light intensity 
difference and the average tillered light intensity differ- 
ence are approximately equal finding a range of values 
in which the difference value falls, and classifying the 
block based upon the value range; and if the average 
light intensity difference and the average filtered light in- 
tensity difference are not approximately equal finding a 
range of values in which the filtered difference value falls 
and classifying the block based upon the filtered value 
range. 



[0006] Some examples of methods according to the 
present invention will now be described with reference 
to the accompanying drawings, in which:- 

s Figure 1 illustrates a composite image and includes 
an example of how such an image may be decom- 
posed into three MRC image planes- an upper 
plane, a lower plane, and a selector plane; 
Figure 2 contains a detailed view of a pixel map and 

io the manner in which pixels are grouped to form 
blocks; 

Figure 3 contains a flow chart which illustrates gen- 
erally, the steps performed to practice the invention; 
Figure 4 contains a detailed illustration of the man- 
's ner in which blocks may be classified according to 
the present invention; 

Figure 5 contains a detailed illustration of the man- 
ner in which blocks may be segmented based upon 
their classification according to the present inven- 
zo Hon; 

Figure 6 contains the details of one embodiment of 
the manner in which block variation can be meas- 
ured as required by the embodiment of the invention 
shown in Figure 4; 
2S Figure 7 contains the details of an embodiment of 
the invention describing classification of blocks 
based upon the block variation measurement pro- 
. vfcJed in Figure 6; 

Figure 8 contains the details of an embodiment of 
so the invention for which context may be updated 
based upon the block classification provided in Fig- 
ure 7; and. 

Figure 9 contains the details of another embodi- 
ment of the invention for updating context based up- 
as on block classification as provided in Figure 7. 

[0007] The present invention is directed to a method 
and apparatus for separately processing the various 
types of data contained in a composite image. While the 

40 invention will described in a Mixed Raster Content 
(MRC) technique, it may be adapted for use with other 
methods and apparatus' and is not therefore, limited to 
a MRC format. The technique described herein is suit- 
able for use in various devices required tor storing or 

45 transmitting documents such as facsimile devices, im- 
age storage devices and the like, and processing of both 
color and grayscale black and white images are possi- 
ble. 

[0008] A pixel map is one in which each discrete U> 
so cation on the page contains a picture element or "pixel" 
that emits a light signal with a value that indicates the 
color or, in the case of gray scale documents, how light 
or dark the image is at that location. As those skilled in 
the art will appreciate, most pixel maps have values that 
ss are taken from a set of discrete, non-negative integers. 
[0009] For example, in a pixel map for a color docu- 
ment, individual separations are often represented as 
digital values, often in the range 0 to 255, where 0 rep- 
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. rrtorant (i e. when CMYK separations are 

used), or the . » osed and 255 , epr esents 

chrominance separat es va|ue ^ 

T Tri^a ^icale pixel map this typical* trans- * 
aTeS ^ J2K*S 'ange Horn 0. for back, to 
ST lor the whitest tone possible. The pixel maps of 
concern in the currently preferred embod.ment of the 
concern in in representat ions of 'scanned* m> 

^TtZT^s^ are created by digitizing » 
^ 9 !. TS ^'oft of Physical media using a digital scan- 

S pixels can take one of two values. 1 or 0. 

ioToi Turnhg now to the drawings tor a more de- 
~ed description of the MRC formal, prxel map 0 rep- « 
esent^g a color or gray-scale document is preferably 

dec^sed intoa three plane page format as indcated 

decompo 1Q are pre1erab |y 

jJX blocks I8^t »,strated in Figure, 2) .foal- 
fow lor beuer image processing effiaency. Th 
™L format is typically comprised ol an upper plane 12. 

f Tine 14 and a selector plane 16. Upper plane 
^Tnd ST Xnett contain pixe.s that describe the 
12 ana iow h ^, erein pix els in each block 18 

Jor example, pbcels that have va.ues above a certain 

f h 7* s hoW may be placed on one plane, while those with 

, £^ are equal to or below the threshold are 
values the are jqual ^ & ^ ^ 

°o an elect spot on either upper plane 12 or tower p.ane 

IM111 The upper and lower planes are stored at the 
1 ! L ri^th and number of colors as the ong.nal pocel 
Tp 10 bT ^ at .educed resolution Selector * 
Sane 16 is created and stored as a bitmap. It .s impor- 
SH> recognize that while the terms 'upper' and 'low- 
er? ate usS to describe the planes on which data re- 
:: de ; it is not intended to limit the inventton to any par- 
ticular arrangement or configuration. 
Sl21 After processing, all three planes are com- 

ewino thereon For example, upper plane 12 and lower 

9 Z^be compressed and stored using a lossy 
plane 14 may be compr seiector ^ 

TneTeTc^pXed an^'ored using a lossless 
S s\ton Ssuch as gzip or CC.TT-G4. 1, would 
Z BDoarenl lo one ot skin in the art to compress and 

stored so 
■! ZlrLd use of the output document. For example, so 

erabiy oe us Qj ^ aw , rovcd 

Slirtnte present invention digtlal image data is 
SabfTproce'sed using a MRC technique such as 
desSibeSa'bove. Pixel map 10 represents a scanned 



image composed of light intensity signals dispersed 
throughout the separation at discrete locations, Again, 
a light signal is emitted from each ol these discrete lo- 
cations, referred to as 'picture elements.' 'pixels or 
•pels * at an intensity level which indicates the magni- 
tude of the light being reflected from the original image 
at the corresponding location in that separation. 
100141 In typical MRC lashion. pixel map 10 must be 
partitioned into two planes 12 and 14. Figure 3contams 
a schematic diagram, which outlines the overall process 
used to segment pixel map 10 into an upper plane 12 
and a lower plane 1 4 according to the present invention. 
Block 1 8 is acquired as indicated in step 21 0; and is clas- 
sified as indicated in slep 220. In Ihe preferred embodr 
iment ol the invention, block 18 will initially be classified 
as either UNIFORM. SMOOTH. WEAK EDGE or 
EDGE and its context - either.TEXT or PICTURE - will 
be provided. The block will then be reclassified as either 
SMOOTH or EDGE, depending upon the initial classifi- 
cation and the context. Nexl. pixels in block 18 are seg- 
mented - placed on either upper plane 12 or lower plane 
14 according to criteria that is most appropriate tor the 
manner in which the block has been classified as indi- 
cated in step 230. This process is repeated tor each 
block 18 in original pixel map 10 until the entire pixel 
map 10 has been processed: Upper plane 12. lower 
plane 1 4 and selector plane 16 are then separately com- 
pressed, using a technique thet is most suitable tor the 
type of data contained on each, as indicated in step 240. 
1001 51 Turing now to Figure 4. generally speaking, 
classif ication of blocks 18 into one of the tour categories 
in step 220 as described above is preferably completed 
in three steps. First, the variation of pixel values within 
the block is determined as indicated in step 310 Block 
variation is best determined by using statistical meas- 
ures, which will be described in detail below with refer- 
ence to Figure 6. Blocks with large variations throughout 
are most likely to actually lie along edges of the image, 
while those containing little variations probably lie in Uni- 
term or at least smooth areas. Measuring the venations 
within the block allows an initial classification to be as- 
signed to it as indicated in step 320. Next, image data 
within each block 18 is reviewed in detail to allow context 
inlormation <i.e. whether the region is in the text or pic- 
ture region of the image) to be updated and any neces- 
sary block re-classifications to be performed as shown 
in step 330. The UNIFORM blocks are reclassified as 
SMOOTH, and the WEAK EDGE blocks are upgraded 
to EDGE in a TEXT context or reclassified as SMOOTH 
in a PICTURE context. A smoothed version 20 of the 
image is also provided by applying a low pass fitter to 
the pixel map 10. Smoothed image 20 is used in con- 
junction with original image data to offer additional in- 
formation during classification, and also provides un- 
screened data for halftone regions. 
[00161 Figure 5 contains details of the manner in 
which block 18 is segmented into two planes, as provid- 
ed in step 230 ot Figure 3. The measurement beg.ns by 
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first determining at step 410 whether the block being 
processed has initially been classified as an EDGE in 
step 220 II so. the values v p of each pixel in the block 
are first compared to a brightness threshold value ^ 
wherein pixels that have values equal to or above t^ are 
viewed as 'bright' pixels, while those with values below 
t are 'dark* pixels. Segmenting EDGE blocks simply 
includes placing dark pixels on upper plane 12 as indi- 
cated in step 440. and placing bright pixels on lower 
plane 14 as indicated in step 450. If it is determined at 
step 410 that block 18 is not an EDGE, all pixels in the 
block are processed together, rather than on a pixel by 
pixel basis. Segmenting of SMOOTH (non-EDGE) pix- 
els occurs as follows: if block 1 8 is in the midst of a short 
run of blocks that have been classified as SMOOTH, 
and further, all blocks in this short run are dark (v p <t) - 
all data in the block is placed on upper plane 12. It the 
entire block 18 is substantially smooth (Le. in a long run) 
or is bright (in a short run of bright pixels), all data in 
block 18 is placed on lower plane 14. 
[001 7] Turning now to Figure 6, the details of one en> 
bodiment of the invention wherein initial block classifi- 
cation via block variation measurement may be accom- 
plished as required by step 31 0 (Figure 4) are now de- 
scribed. A threshold. t.. which allows the block to be di- 
vided into two portions is first calculated as indicated in 
step 510. In the preferred embodiment of the invention, 
this threshold is obtained by performing a histogram 
analysis on the data in the block, but many standard 
methods can be used to perform this analysts. For ex- 
ample, the value that maximizes between distances of 
the criteria being used for separation or provides for 
maximum separation between the two portions of the 
block can be selected. Those skilled in the art will rec- 
ognize that other methods of choosing the best thresh- 
old are available and the invention is not limited to this 
embodiment. Block 18 is then thresholded into these 
two parts by comparing the light intensity value of each 
pixel to the selected threshold V as indicated in step 
520 As before, if the pixel value v p is less than the 
threshold, the pixel is referred to as dark. If v p is greater 
than or equal to V the pixel is bright. 
10018] As stated earlier, a smooth version 20 of the 
image is obtained by applying a low pass filter to the 
original image data. Average values lor bright and dark 
pixels are then obtained tor both the original and 
smoothed sets of image data. Looking first at the bright 
pixels, one value calculated will be v BPIXEL . ihe average 
value for all of the bright pixels in original pixel map 10 
(v 3 1 6 ) which are located in the area covered by block 
l8 P as S indicated in step 540. Another value, Ve^oo™, 
the average value for all of the bright pixels in smoothed 
version 20 of the image which are located in the area 
covered by block 18 will also be obtained as -shown in 
step 560 Dark values are calculated similarly. That is. 
v the average value for all of the dark pixels in 

original'pixel map 10 (v p < t s ) which are located in the 
area covered by block 18 will be obtained as shown in 



step 550, and v DSMOOTH . the average value tor all of the 
dark pixels in the smoothed version 20 of the image 
which are located in the area covered by block 18 will 
be obtained as in step 570. Once these average values 

5 are obtained, the distances d and d c between brighter 
and darker averages tor pixel map. 1 0 and smoothed im- 
age 20 respectively are calculated as indicated in step 
580. That is d= V BPlxEL - V dp ,xel- and dg = v^moo^ - 
v dsmooth- Since is typically almost equal to 1 tor 

io contone images, the ratio of d/d s may be used to detect 
halftones. 

[0019] Figure 7 contains a detailed illustration of step 
320. of Figure 4, the preferred embodiment of a process 
tor initially classifying blocks 18. As shown, a relative 

is comparison between d and c^ is obtained as indicated 
in step 610 in order to determine whether the block con- 
tains contone (d » d 8 ) or halftone data. Block 18 will in- 
itially be classified as one of tour types: UNIFORM. 
SMOOTH. WEAK EDGE or EDGE according to the 

20 magnitude of the distance d or d,. Distance d is used to 
classify contone blocks, while distance is used for 
halftones. For contone data d. the value from pixel map 
10. is compared to value Xq as shown in step 620. 
[0020] If d is very low (i.e. d< Xq), all pixel values in 

2S the block are substantially the same and the block is 
classified as UNIFORM at step 640. II there are some- 
what small differences in pixel values in the block such 
that x 0 <d<x 1 as shown in step 622. the block is classified 
as SMOOTH, at step 650. If there are fairly large differ- 
so ences in pixel values in the block and x n <d<x 2 at step 
624. the block will be classified as WEAK EDGE. It the 
differences in the block are very large and cPx2 at step 
624. the block will be classified as an EDGE at step 670. 
[0021] If d/d^ is not approximately equal to 1. d* is 

3S compared to threshold y 0 at step 630. it should be noted 
there that two different sets of thresholds are applied for 
halftones and contones. Thus, on most occasions. 
x 0 1 y 0 , x 1 1 y 1 , and x 2 1 y 2 - The process used to classify 
halftone blocks is similar to that used for contone data. 

ao Thus, if d 6 <y 0 at step 630 the block is classified as UNI- 
FORM at step 640. If y 0 <d 6 <y 1 in step 632. the block is 
classified as SMOOTH, at step 650. If y-,<d 6 <y 2 as indi- 
cated in step 634. the block is classified as a WEAK 
EDGE at step 660. It cPx 2 at step 634, the block will be 

«s classified as an edge at step 670. 

[0022J Referring now to Figures 6 and 9. the details 
tor updating the context of the block will now be provid- 
ed. The context of a block is useful when the average 
between the dark and bright areas of the block is rela- 

50 tively high. When this is the case, the block can classi- 
fied as an EDGE as long as its context is TEXT. The 
context is initially set equal to PICTURE. It is changed 
to TEXT if one of two rules is satisfied: (1) the block be- 
ing processed is in a long run of UNIFORM blocks and 

SB the average of the dark pixel values in the block is great- 
er than a preset brightness threshold; or (2) the block 
has been classified as either UNIFORM. WEAK EDGE, 
or EDGE, one of the top, left or right neighboring blocks 
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has a context which has been set equal to TEXT, and 
the difference between that neighboring block and the 
current block is smaller than a preset propagation 
threshold. 

[0023] Turning first to Figure 8, determining whether 
block context should be changed according to the first 
rule requires finding a run of blocks that have been clas- 
silied as UNIFORM as indicated in step 704. Finding a 
run o1 UNIFORM blocks typically involves comparing 
the number of consecutive UNIFORM blocks to a run 
. length threshold t^ as indicated in step 706. The run 
length threshold sets the number ot consecutive blocks 
that must be classified as UNIFORM for a run to be es- 
tablished. As also indicated in step 706, V DPIXEU . the av- 
erage value ot the dark pixels tor consecutive blocks is 
compared to the brightness threshold V A large number 
ot consecutive UNIFORM blocks with high brightness 
levels usually indicates that the blocks contain large 
background page areas (i.e. large white areas), thereby 
indicating thai lext is present. Thus, if the number of con- 
secutive UNIFORM blocks exceeds \jj and V op ,xel > 
t fi , the context for the block is changed to TEXT as indi- 
cated in step 708. 

10024] H either the number of identified consecutive 
blocks is too small to establish a run or the blocks are 

dark (Vdpixel e tnG contGXt win remain 6Gt equal t0 
PICTURE. Whether additional runs are present in the 
block will be determined as indicated in step 710. and if 
so the process will be repeated as indicated in the illus- 
tration. . 
[0025] Turning now to Figure 9. changing the context 
ot a block to TEXT under the second rule first requires 
providing a propagation threshold V- The propagation 
threshold defines the level ol brightness that will indicate 
that the block covers blank page areas. Under the sec- 
ond rule, the context will be changed from picture to text 
at step BOB il the block is not SMOOTH (i.e. is UNI- 
FROM, and EDGE or a WEAK EDGE) as shown in step 
802 either its top. left or right neighbor has a text context 
as indicated in step 804 and v BDIF , the average differ- 
ence between bright pixels in the block and bright pixels 
in the neighbor text context block is less than tp as 
shown in step 806. Neighbor blocks are checked be- 
cause presumably blocks that contain text will be locat- 
ed next to other blocks that contain text. However, the 
brightness value of the block is compared to that of its 
neighbor to assure that this is the case. In other words, 
even if the block has a neighboring block wilh a text con- 
text a large difference between the average brightness 
of block and its neighbor means that the block contain 
does not contain the large blank page areas that indicate 
the presence of text. 

[0026] Again, the present invention is directed to seg- 
menting the data by first identifying blocks that contain 
the edges of the image and then separating the blocks 
such that those which contain the smooth data and 
bright sides ot the edges are placed on the lower plane 
and the dark sides of the edges are placed on the upper 



plane. Once each o1 the respective planes is generated, 
ordinary MRC processing continues. That is, each plane 
is compressed using an appropriate compression tech- 
nique. In the currently preferred embodiment, upper 

s plane 12 and lower plane 14 are compressed using 
JPEG while the selector plane 16 is compressed using 
a symbol based pattern matching technique such as 
CCITT Group IV or a method of classifying scanned 
symbols into equivalence classes such as that de- 

io scribed in US-A 5.778.095 to Davies issued July 7, 
1998, the contents ot which are hereby incorporated by 
reference. The planes are then joined together and 
transmitted to an output device, such as a facsimile ma- 
chine or storage device. 

15 

Claims 



1. A method ot segmenting digital image data tor 
mixed raster content processing, comprising: 

a) acquiring a block of the digital image data, 
wherein the digital image data is composed of 
light intensity signals in discrete locatione; 

b) designating a classification tor said block 
and providing an indication about a context of 
said block; 

c) segmenting said light "intensity signals in said 
block into an upper subset and a lower subset 
based upon said designated classification; 

d) generating a selector set which tracks said 
light intensity segmentation; and 

e) separately compressing the digital image da- 
la contained in said upper and lower subsets. 

2. A method of segmenting digital image data as 
claimed in claim 1. wherein said classification indi- 
cates that said block contains substantially smooth 
data and/or substantially edge data. 

3. A method of segmenting digital image data as 
claimed in claim 1 or claim 2, wherein said classifi- 
cation data designating step lurther comprises: 

a) measuring an amount o1 light intensity signal 
variation throughout said block; 

b) assigning a classification to said block based 
upon said measured light intensity signal vari- 
ation; and 

c) updating said context indication for said 
block, and designating classification for said 
block based upon said updated context. 

4. A method of segmenting digital image data a6 
55 claimed in any ot the preceding claims, further com- 
prising: 

a) dividing a low pass filtered version of said 
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block into a bright region and a dark region; 

b) calculating average filtered light intensity val- 
ues lor said bright region and tor said-dark re- 
gion; and 

c) obtaining a difference in average filtered light 
intensity values between said bright region and 
said dark region. 

5 A method of segmenting a block of digital image da- 
ta into an upper and lower subset, wherein the block 

. of data is composed of light intensity signals in dis- 
crete locations, comprising: 

a) determining whether the block is located on 
an edge in the digital image; 

b) if the block is on an edge, comparing a mag- 
nitude of each light intensity signal in the block 
to a brightness threshold and placing said sig- 
nal in the upper subset If said light intensity 
magnitude exceeds said brightness threshold 
or in the lower subset if said light intensity mag- 
nitude is less than said brightness threshold; 
and 

c) if the block is not located on an edge, placing 
the block in the upper subset if the block is in a 
group of blocks that have light intensity values 
which are indicative of smooth and dark image 
data, and otherwise placing the block in the low- 
er subset. 

6. A method of classifying a block of digital image data 
into one of a plurality of image data types, wherein 
the block of data is composed of light intensity sig- 
nals in discrete locations, comprising: 

a) dividing the block into a bright region and a 
dark region; 

b) dividing a low pass filtered version of said 
block into a bright region and a dark region; 

c) calculating average light intensity values tor 
each of said bright region, said dark region, 
said filtered bright region and said filtered dark 
region; and 

d) comparing a difference between said bright 
region and said dark region average light inten- 
sity values to a filtered difference between said 
bright region and said dark region average fil- 
tered light inlensity values; 

e) if said average light intensity difference and 
said average filtered light intensity difference 
are approximately equal finding a range of val- 
ues in which said difference value falls, and 
classifying said block based upon said value 
range; and 

f) if said average light intensity difference and 
said average filtered light intensity difference 
are not approximately equal finding a range of 
values in which said filtered difference value 



falls and classifying said block based upon said 
filtered value range. 

A method according to any of claims 1 to 4, wherein 
blocks are classified by a method according to claim 
5 or claim 6. 



10 



is 



20 



2S 



30 



35 



40 



46 



SO 



ss 



6 



EP 1 006 716 A2 




FIG. 1 



EP 1 006 716 A2 




EP 1006 716 A2 



210 



GET NEW 
BLOCK 



220 



CLASSIFY 
BLOCK 



230 



SEGMENT PIXELS 
IN BLOCK INTO TWO 
PLANES BASED ON 
CLASSIFICATION 



250 



MORE 
BLOCKS IN PIXEL 
MAP? 



NO 



240 



COMPRESS 
BOTH PLANES 



FIG. 3 



9 



EP 1 006 716 A2 



FROM 210 



370 



1 



320 



330 



MEASURE VARIATION 
OF VALUES IN BLOCK 




r 


CLASSIFY BLOCK 
BASED UPON AMOUNT 
OF VARIATION 




r , 



UPDATE CONTEXT 
PORTION OF BLOCK 
CLASSIFICATION AND 
PERFORM NECESSARY 
RE-CLASSIFICATIONS 



T 



TO 230 



220 



FIG. 4 



10 



EP 1 006 716 A2 




11 



EP1 006 716 A2 




FIND t S 




540 



CALCULATE ^BpixEL ,THE 
AVERAGE FOR BRIGHT PIXELS 
IN THE PIXEL MAP 




^— 560 


CALCULATE V I 
AVERAGE FOR 
IN THE SMO< 


JsmoothJhe 
bright pixels 
dthed image 



550 



CALCULATE V D P |XEL JHE 
AVERAGE FOR DARK PIXELS 
IN THE PIXEL MAP 


570 


f 


CALCULATE^! 
AVERAGE FO 
IN THE SMO< 


^SMOOTH #TH E 
R DARK PIXELS 
DTHED IMAGE 



< 


^-5*0 




CALCULATE 








d = V B P |XEL" 


~ V Dp|XEL 






ds = V B SMOOTH "~ V DSMOOTH 






> 



TO STEP 320 



FIG. 6 



12 



EP1 006 716 A2 



FROM 310 



32Q 



NO 





620 



UNIFORM 



T 




TO 330 650 



SMOOTH 



TO 330 




YES 




WEAK 


YES 




EDGE 






622 



624 



NO 

680- 


( 

660 


NO 


EDGE 


► 





TO 330 

FIG. 7 



1I106716A2 I > 



13 



I 



EP 1 006 716 A2 



330 



FROM 320 







SET CONTEXT - PICTURE 
FOR ALL BLOCKS 






FIND A RUN OF UNIFORM 
BLOCKS 




• S~ 706 



702 



704 

V 




YES 



708 



SET 

CONTEXT = TEXT 
FOR BLOCK 



NO 



770 



YES 



MORE 
RUNS OF UNIFORM 
BLOCKS? 



NO 
TO 230 

FIG. 8 



14 



EP1 006 716 A2 




TO 230 

FIG. 9 



1S 



(19) 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 




(12) 



(88) Date of publication A3: 

19.09-2001 Bulletin 2001/38 

(43) Date of publication A2: 

07.06.2000 Bulletin 2000723 

(21) Application number: 99309522.3 

(22) Date of filing: 29.11-1999 

(BA) Designated Contracting States: 

AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 
MCNLPTSE 

Designated Extension States: 
AL LT LV MK RO SI 

(30) Priority: 02.12.1998 US 203870 

(71) Applicant: Xerox Corporation 
1 Rochester, New York 14644 (US) 



(ID EP1 006 716 A3 

EUROPEAN PATENT APPLICATION 

(51) lntCl7: H04N 1/64 



(72) Inventors: 

• Fan, Zhigang 
Webster, NY 14580 (US) 

• Xu, Ming 

Rochester, NY 14618 (US) 

(74) Representative: Skone James, Robert Edmund 
GILL JENNINGS & EVERY 
Broadgate House 
7 Eldon Street 
London EC2M 7LH (GB) 



(54) 



Method and apparatus lor segmenting data to create mixed raster content planes 



(57) An improved technique for compressing a color 
or gray scale pixel map representing a document using 
an MRC format includes a method of segmenting an 
original pixel map into two planes (12,1 6), andthen com- 
pressing the data or each plane in an efficient manner. 
The image is segmented by separating the image into 



two portions at the edges. One plane contains image 
data for the dark sides of the edges, while image data 
for the bright sides of the edges and the smooth portions 
of the image are placed on the other plane. This results 
in improved image compression ratios and enhanced 
image quality. 



CO 
< 
CD 

T- 

CO 
O 

o 



10 





FIG. 7 



a. 
uj 



Printed by Jouve. 76001 PARIS (FR) 



EP1 006 716 A3 




European Patent 
Office 



EUROPEAN SEARCH REPORT 



Applies! Ion Number 

EP 99 3Q 9522 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 
A 



Citation ot document with indication, where appropriate. 
ot relevant passages 



P.A 



A.D 



US 5 767 978 A (FAN 2HIGANG ET AL) 
16 June 1998 (1998-86-16) 

• abstract; claims; figures * 

EP 0 358 815 A (OCE NEDERLAND BV) 
21 March 1990 (1998-63-21) 

* abstract * 

US 5 014 124 A (FUJISAWA TETSUO) 
7 May 1991 (1991-05-87) 

* abstract * 

US 5 949 555 A (SAKAI AK1H1K0 ET AL) 
7 September 1999 (1999-89-87) 

• abstract * 

; US 5 778 ©92 A (VINCENT LUC ET AL) 
7 July 1998 (1998-87-87) 



Relevant 
to dam 



1,5.6 



1.5.6 



1.5.6 



1.5.6 



CLASSIFICATION OP THE 
APPLICATION jtpLCLT) 



H84N1/64 



TECHNICAL FIELDS 
SEARCHED flnLCL7) 



H84N 
G86K 
G86T 





The present scorch report has been Or awn up lor all ctaitns 




THE HAGUE 


Data at corrtMbon ot Bnr swch 

1 August 2001 


turanw 

Isa, S 



X : parbuAirly letevaiO I l^n abno 

Y pait k^ itirV re****" 1 * combmnd w*h anothef 

coewmeof c4 the same caieoory 
^il^etooicalc^rotind 
O : txjn-wTTnen rjisctoa jre 



T : theory or ptinopM unotef lying vK invention 
E : eb'Um patoni doajment but pubfchsu on. er 

after teeing date 
D : document cited in trie apptcation 
I : document died tor other reasons 

A Tmorribei dt ine "satne pmnilarriry. corresponding 

rtOCUlTMtnJ 



2 



EP 1 006 716 A3 



ANNEX TO THE EUROPEAN SEARCH REPORT 
ON EUROPEAN PATENT APPLICATION NO. 



EP 99 36 9522 



_ . lisis the oaten, lamily members relate to the patent documents ched in the above-mentioned European search report. 

^ =e^_.e ssl™»^^J^£™^£ ^remere* ^ ftx^e purpose o, interna**, 

61-68-2681 



Tlw European Patent Office ts in no way I 



Patent document 
cited in search report 




Publication 
date 




Patent family 
members) 


I Publication 
date 


US 5767978 


A 


16-06-1998 


EP 


0855679 A 


29-07-1998 


EP 0358815 


A 


21-63-1996 


DE 
OE 
JP 
JP 
US 


3881392 0 
3881392 T 
2105978 A 
2818448 8 
5073953 A 


21-16-1993 
18-04-1998 
30-18-1998 
17-12-1991 


US 5614124 


A 


07-05-1991 


JP 
JP 


2684879 A 
2815157 B 


27-16-1998 


US 5949555 


A 


67-09-1999 


JP 


7220691 A 




US 5778092 


A 


07-07-1998 


NONE 





a, 



I For more betate abo.il this annex : see Crtcial Journal of the European Paieni Office. Mo. 1 2/E2 



